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Abstract —Technology of autonomous vehicles (AVs) is getting 
mature and many AVs will appear on the roads in the near future. 
AVs become connected with the support of various vehicular 
communication technologies and they possess high degree of 
control to respond to Instantaneous situations cooperatively with 
high efficiency and flexibility. In this paper, we propose a new 
public transportation system based on AVs. It manages a fleet 
of AVs to accommodate transportation requests, offering point- 
to-point services with ride sharing. We focus on the two major 
problems of the system: scheduling and admission control. The 
former is to configure the most economical schedules and routes 
for the AVs to satisfy the admissible requests while the latter is 
to determine the set of admissible reqnests among all reqnests to 
produce maximum profit. The scheduling problem is formnlated 
as a mixed-integer linear program and the admission control 
problem is cast as a bilevel optimization, which embeds the 
scheduling problem as the major constraint. By utilizing the 
analytical properties of the problem, we develop an effective 
genetic-algorithm-based method to tackle the admission control 
problem. We validate the performance of the algorithm with 
real-world transportation service data. 

Index Terms —Autonomous vehicle, admission control, bilevel 
optimization, smart city. 


I. Introduction 

H uman mobility is largely supported by public transport. 

Many people rely on public transport to move from 
one place to another when the destinations of their journeys 
are not within walkable distances. To transform a city with 
limited room for large-scale infrastructure into a smart city, its 
public transportation system may need to be further upgraded 
mainly from the existing road networks. Representatives of 
road-based public transport are buses and taxis, each type 
of which has its pros and cons. In general, buses follow 
hxed routes offering shared ride so that more passengers can 
be served on each single journey. On the other hand, taxis 
offer private services and run on flexible dedicated routes 
based on the passengers’ requests. Nevertheless, no single 
one type can support high throughput and flexibility at the 
same time. The efficiency and capacity of the whole public 
transportation system may be enhanced if there exists a new 
public transport which can accommodate many people in a 
short period of time and concur high mobility. It may maintain 
flexibility by offering point-to-point services while enhancing 
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efficiency by supporting shared ride. Such kind of public 
transport requires several characteristics which may not be 
possessed by a typical public transport. To develop such a 
public transport, the vehicles need to cooperate to take up 
customers’ requests instead of cruising around the city for 
random offers. To enhance the efficiency and cooperativeness, 
a control center can be employed to coordinate all the vehicles, 
manage all the service requests, and assign the vehicles to 
serve the requests. Moreover, the vehicles should follow the 
routes and carry out the travel plans instructed so as to achieve 
system-wise objectives. Recently, autonomous vehicles (AVs) 
have been undergone active research and we can expect 
many AVs running on the roads in the near future. The 
AV is a good candidate possessing most of the requirements 
mentioned above. Hence AVs can be adopted to construct a 
new smart public transportation system with high efficiency 
and flexibility. 

In this paper, we introduce an intelligent AV-based public 
transportation system. It manages a fleet of AVs to accommo¬ 
date transportation requests, offering point-to-point services 
with ride sharing. We focus on two important problems in 
the system: scheduling and admission control. The former is 
about how to assign the designated vehicles to the admissible 
transportation requests, and when and where the vehicles 
should reach to provide services with the lowest cost. The 
latter is to determine the set of admissible requests among 
all requests to achieve maximum revenue. As a whole, the 
contributions of this paper include: 


• proposing the AV public transportation system; 

• improving the model for scheduling proposed in H], such 
that the formulation developed in this paper can now 
support both directed and undirected graphs; 

• developing distributed scheduling; 

• formulating the admission control problem; 

• introducing the concept of admissibility and the related 
analytical results; 

• designing an effective method to solve the admission 
control problem; and 

• validating the performance of the solution method with 
real-world transportation service data. 


The rest of this paper is organized as follows. Related work is 
given in Section |I^ and we present various system components 
and their operations in Section III The scheduling problem 
is discussed in Section |IV] In Section |V] we formulate the 
admission control problem and provide the related analytical 
results. We propose a genetic-algorithm-based solution method 
for admission control and develop distributed scheduling in 
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Section|V2 Section |VII| evaluates the system performance with 
real-world transportation service data. Finally we conclude this 


paper in Section VIII 


II. Related Work 

The concept of AVs was raised in the 1920’s and the 
research thereof has started for more than thirty years. An 
AV is equipped with many sensors, which provide the vehicle 
with full sensing ability so as to adapt to the neighborhood 
environment and realize fully automated control. In 2007, 
the DARPA Urban Challenge boosted the awareness of AVs 
capable of being driven in traffic and performing complex 
maneuvers Q. In 2010, VisLab carried out the experiment 
that several driverless vehicles successfully traveled 13,000 km 
from Italy to China 0. Google demonstrated an AV prototype 
in 2011 in. By the end of 2013, several states in the United 
States, including Nevada, Florida, California, and Michigan, 
had passed the law to allow AVs running on public roads 
0. The first self-driving shuttle on sale was from NAVIA 
0. Other automotive manufacturers, like Mercedes-Benz 0, 
BMW, and Audi 0, have invested in self-driving technologies 
and include AVs in their production plans. 

Most research work on AVs mainly focused on the con¬ 
trol and communication aspects. Mladenovic and Abbas a 
proposed a self-organizing and cooperative control framework 
for distributed vehicle intelligence. Hu et al. cni studied lane 
assignment strategies for connected AVs and proposed a lane 
changing maneuver to balance the tradeoff between efficiency 
and safety. Petrov and Nashashibi im developed a feedback 
controller for autonomous overtaking without utilizing road¬ 
way marking and inter-vehicle communication. Li et al. in 
presented a multi-level fusion-based road detection system for 
driverless vehicle navigation to ensure safety in various road 
conditions. All these show that AV is a promising technology 
with the support from governments, high-tech companies, and 
car manufacturers. 

Vehicles can communicate with each other and fixed infras¬ 
tructure via various vehicular wireless communication tech¬ 
niques m. Nowadays vehicular communications are mostly 
deployed over satellite, cellular networks, and vehicular ad- 
hoc networks (VANETs) lfT3l . VANET is a mobile ad-hoc 
network where vehicles act as the mobile nodes HI and it 
can improve the communication capacity and organization of 
AVs constituting an intelligent transportation system. Eurda et 
al. Ha introduced a wireless communication framework for 
driverless vehicles. It facilitated vehicle-to-vehicle and vehicle- 
to-infrastmcture communications and improved the safety and 
efficiency of vehicles. Alsabaan et al. ifThl made use of traffic 
light signals and vehicle-to-vehicle (V2V) communications 
to help vehicles adapt their speeds and avoid unnecessary 
stop, acceleration, and excessive speed. Gomes et al. H3 
designed a driver-assistance system which allowed a vehicle 
to collect real-time camera images from other vehicles in the 
neighborhood over V2V communications. In this way, AVs 
become connected and can communicate with the control 
center. 

Shareability of taxi services has been studied recently. 
Santi et al. na investigated the tradeoff between passenger 


TABLE I 

Contributions to the system. 


Technology/ 

feature 

Example 

Contributions 

Ref. 


VisLab 

Demonstrate the feasibility 
of AVs 

m 

Hai'dwai'e 

Google 

Show the confidence of the 
industry in AVs 

ID 


Mercedes- 

Guarantee supply of AVs 

0. (I), 0 


Benz, BMW, 
Audi, NAVIA 

for the system 



Mladenovic & 

Enhance self-organizing and 

El 


Abbas 

cooperative control of AVs 


Software 

Hu et al. 

Balance the efficiency and 
safety of AVs 

Qo) 


Petrov & 

Nashashibi 

Enhance self-control of AVs 

im 


Li et al. 

Improve safety of AVs 

El 


Nevada, 

Demonstrate the support of 

m 

Law 

Florida, 
California, 
and Michigan 

governments 



Cottingham 

Introduce the vehicular 
wireless communications 
available to be used in the 
system 

GD 


Dahiya & 

Improve the communication 

m 


Chauhan 

capacity and organization of 
AVs 


Communicatio 

isFurda et al. 

Enhance the communica¬ 
tions between AVs and the 
control center 

E) 


Alsabaan et al. 

Improve the comfort of AVs 

(HI 


Gomes et al. 

Collect data for the system 
to estimate traffic conditions 

El 

Ridesharing 

Santi et al. 

Ma et al. 

Confirm the ridesharing 
functionality of the system 

□a 

El 

AV public 
transporta¬ 
tion 
system 

Lam et al. 

Lam et al. 

Provide a proof of concept 
Investigate the scheduling 
and admission control prob¬ 
lems 

ID 

This work 


inconvenience and collective benefits of sharing and concluded 
that a small increase in discomfort could induce the significant 
benefits of less congestion, less running costs, less split fares, 
less polluted, and cleaner environment. Ma et al. proposed 
a taxi ridesharing system called T-Share in HU, where the 
dynamic taxi ridesharing problem was studied. Eor a dataset 
of taxi services in Beijing, it showed that 25% additional 
taxi users could be served with saving of 13% of total travel 
distance. These studies confirmed that ridesharing is beneficial 
but they mostly focused on taxi services. In this paper, we 
focus on AVs, which have a key intrinsic property hardly 
found in the standard taxis: the direct control of vehicles 
does not involve any human factors. In other words, AVs can 
completely follow the instructions from the control center in 
the sense that they neither undertake any unassigned requests 
nor reject any assigned requests. We can see that AVs can 
fully cooperate to achieve the system objective but it may not 
be the case for human-driving taxis. 

The AV public transportation system is uniquely designed 
and it can help improve the capacity and flexibility of the 
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future transportation system. To further demonstrate its feasi¬ 
bility, we show how the existing work discussed above may 
contribute to the system in Table |I] 

The scheduling problem has been introduced in H] and it 
can be considered as a variant of the Dial-A-Ride Problem 
(DARP) II 20 I . However, in our AV scheduling problem, we 
allow modifying the previously assigned but not yet served 
requests at desirable times to achieve system-wise performance 
goal. When the system evolves, the AVs appear at different 
locations at different time instants. It may happen that a 
particular request can be better served by a different AV at 
different times. Consider an example with two AVs, I and 
II. At a paricular time, AV-I is in the neighborhood of a 
location while AV-II is not. A request originated from this 
location may be better served by AV-I. After some time, 
AV-I may have gone away but AV-II may have come into 
the neighborhood. Then the request may be better served by 
AV-II instead. As the AVs are connected through appropriate 
vehicular communication technologies, the schedules of AVs 
can be revised from time to time. We consider this in our 
formulation making our scheduling problem different from 
DARP. As the system involves a number of AVs, determining 
their schedules in a distributed manner can undoubtedly speed 
up the process. Distributed scheduling has been advanced in 
many engineering disciplines, e.g., communication networks 
ED, ED. As a new system, we will dedicatedly design a 
distributed methodology for the scheduling thereof. 

Admission control generally refers to a validation process 
in communication systems for quality-of-service assurance. It 
determines which new connection or service request can be 
granted with resources for subsequent operations. For example, 
Il2^ designed an admission control mehanism to add or drop 
session requests in 4G wireless networks and Il24l discussed 
various admission control algorithms for multi-service IP 
networks. We adopt this idea in the transportation system 
and design an admission control mechanism to differentiate 
the transportation service requests for maximizing the total 
profit. There are many methods to facilitate admission control. 
Genetic Algorithm (GA) is one of them and it has been 
successfully utilized to design admission control mechanisms, 
e.g., ESl and 1^ . Based on the special formulation of the 
admission control problem (to be discussed in Section |V]l, we 
will also adopt GA to solve the problem. 

III. System Model 

In this section, we design the architecture for the system 
which can manage a fleet of AVs to serve customers for 
transportation services. In the following, we first introduce 
the system components and then describe the operations 
characterizing their interactions. 

A. System Components 

1) Network Structure: A graph is employed to model 
the region being served by the system. It characterizes the 
locations and the road connections necessarily to describe 
movements of the AVs, origins and destinations of the service 
requests, and other required facilities. It is a directed graph 


denoted by G{y,8), where V is a set of locations and S 
refers to the road segments connecting the locations so that 
we can completely describe the routes of AVs with G. For 
i,j G V, each edge (i,J) G £ is associated with an operational 
cost Cij and a travel time which is an estimation of time 
for an AV to traverse from i to j based on historical data. 
Depended on the system objective, Cij typically represents the 
distance of the road segment {i,j) as the operational cost of 
AVs is usually measured by the fuel consumption which is in 
turn characterized by the travel distance. If the system aims 
to optimize the total service duration, we can set = tij for 


all (i,j)’s. We allow Ci 


^ Oji and tij 


^ tji 


to account for 


the asymmetry of road segments. Moreover, refuel stations are 
located in some locations specified by V C V and each AV 
ends its journey at any one of these refuel stations (reasons 
explained in Section III-B1| |. Based on the nature of the AVs, 
V C V will be the locations of charging (gas) stations if 
the AVs are electric (conventional) vehicles. For the case of 
electric vehicles, V C V can be determined based on the 
charging demand and the connectivity of the charging station 
network according to ED- 

2) Transportation Requests: Customers request services in 
the form of transportation requests, which are collectively 
denoted by TZ. Each r G TZ is represented by the 5-tuple 
{sr,dr,Tr,[er,lr],qr)- Sr G V and dr G V represent the 
customer pickup and dropoff locations, respectively. Tr is the 
maximum ride time, an exceedance of which will lead to 
customer dissatisfaction. [er,lr] refers to the service starting 
time window, where and Ir are the earliest and latest service 
starting times, respectively, qr stands for the number of seats 
needed in the request r. 

3) Vehicles: The system coordinates a fleet of AVs de¬ 
noted by JC. Each k G K, is represented by the 5-tuple 
(o/t, Tfc, Qfc, Qk G V is the first location where k 
will visit from the current position of k while is the time 
required to reach Uk from its current position. It is possible 
that, at the time of scheduling, the AV is in the middle 
of a road segment heading to Ofc. ak and can be easily 
estimated by submitting its current position to the system. Tk 
denotes the maximum remaining operation time that k can 
continue to provide services without refuelingj^ Qk is the 
passenger capacity that k can accommodate simultaneously. 
TZk — TZk U TZk G TZ is the set of requests previously 
assigned to k. TZk can be further categorized into two types; 
TZk contains those currently being served by k while TZk was 
assigned to fc at a previous schedule but the services have 
not been implemented yet. Eor the former, some seats have 
already been taken by the customers from TZk- On the contrary, 
seats have only been reserved but no actual seats have been 
taken from TZk- We will handle TZk and TZk differently when 


performing scheduling in Section IV 


Without loss of generality, we assume that the number of 
seats required in any request is no larger than the capacity of 
any vehicle, i.e.. 


qr < Qk^r GTZ,k G IC- 


( 1 ) 


*The maximum remaining operation time of k can be converted from its 
corresponding remaining fuel level. 
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Fig. 1. Operating intervals in the system. 
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Fig. 2. Operation flow of the system. 


We can always split those requests violating Q into multiple 
requests so that this condition always holds. 

B. Operations 

The system is managed and operated by a control center 
whose main duties are to collect all the required information 
and assign the AVs to serve the transportation requests. The 
system operates in a hxed time interval basis and each time 
interval is divided into data collection and duty assignment 
sub-intervals (see Fig. [^. In each interval, the control center 
hrst collects transportation requests and vehicle statuses in 
the data collection sub-interval. Then the AVs are assigned to 
serve the transportation requests in the duty assignment sub¬ 
interval. On one hand, the duration of each interval should 
be long enough such that the communication delays will not 
result in any data missing from the customers and vehicles 
for scheduling. On the other hand, it should be short enough 
such that the collected data can reflect the current situation 
happening in that interval. In practice, the data collection sub¬ 
interval is longer than the duty assignment one. The former 
may last for a few minutes while the latter may takes a few 
seconds. 

Fig. 1^ illustrates the operation flow of the system with re¬ 
spect to an operating interval. As powered by various wireless 
vehicular communication technologies, all AVs are connected 
and can communicate with the control center instantaneously. 
In this way, the control center can collect the necessary 
vehicle statuses, e.g., current locations of AVs, confirmation 
of serving requests, traffic congestion information, etc., in the 
data collection sub-interval. Customers can also submit their 
requests to the control center by any appropriate means, e.g.. 


phone calls, mobile apps, etc. After the data collection sub¬ 
interval, all the data required to perform duty assignment are 
ready at the control center. 

In the duty assignment sub-interval, the control center 
processes the collected data and computes the duty assignment. 
There may exist some unattended requests incurred from some 
previous intervals because of their unsuitability in the previous 
system conditions. They are merged with the newly submitted 
requests and then all these requests are considered en masse. 
The duty assignment further consists of two processes: admis¬ 
sion control and scheduling. Admission control checks all the 
outstanding requests and determines which requests are going 
to be admitted in the current interval. The unadmitted requests 
will be reserved for consideration in the next interval again. 
Any invalid or inappropriate requests are also permanently 
excluded in the admission control process. We compute the 
travel schedules of the AVs to serve the admitted requests in 
the scheduling process. If a vehicle is assigned with a request, 
its schedule settled by the control center needs to satisfy the 
following requirements: 

1) Complete route specification: Since the vehicle is un¬ 
manned, we need to specify the exact route so that the vehicle 
can follow the route to pick the passengers of the assigned 
requests up and to drop them off at the required destinations. 
Moreover, the route should be short enough so that it has 
sufficient fuel to complete the route. The vehicle should end 
up at a refuel station to avoid breaking down in the middle of 
any road segments. This can guarantee that the vehicle must 
be able to refuel after completing all the assigned services. 

2 ) Time constraints: The vehicle should be able to pick the 
passengers up at a time within the service starting time window 
specified in the request. Moreover, the actual ride time should 
be no longer than the maximum value stated in the request. 

3) Capacity constraints: When the vehicle arrives at the 
pickup location, there should always be enough free seats 
available to accommodate all the passengers of the request. 

Admission control and scheduling are inter-related and we 
will discuss their details in the subsequent sections. After 
determining the result, the control center then distributes the 
assignments to the corresponding AVs, which provide services 
to the customers. 


IV. Scheduling 

Scheduling involves determining the following: 

• the assignment of AVs to the requests; 

• the routes of AVs to accomplish the assigned requests; 
and 

• the times by which the AVs should reach particular 
locations. 

Here we assume that all requests being scheduled are ad¬ 
missible, where the admittability of a request is handled by 
admission control. Thus all requests will be served by appro¬ 
priate vehicles after scheduling. When discussing admission 
control in Section |V] we will explain the relationship between 
admission control and scheduling. 

To facilitate scheduling, we assume that all vehicles are 
connected and can communicate with the control center with 
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reasonably short delays. This ensures that no apparent changes 
in positions happen to the AVs in each interval given in 
Fig. B With the support of modern advanced communication 
technologies, this assumption can go through. In our model, 
we require that the computation of scheduling can be done in a 
short period of time. This ensures the validity of the traffic data 
when the vehicles traverse along their assigned routes. There 
are basically two types of traffic data; the distances and travel 
times of road segments. The former is time-invariant while 
the latter usually changes gradually. In other words, signihcant 
changes in travel times only take place in a timespan much 
longer than the time interval. 

A. Preprocessing 

We schedule the AVs to accomplish the transportation 
requests to achieve the minimum total operational cost in 
terms of fuel costs, which are in turn measured by the total 
distance traveled. The distance between any pair of locations 
is invariant and we transform G{V^£) to G'{V\£') with any 
shortest path algorithm, e.g., Dijkstra’s algorithm ll28l . where 
V' C V is the set of locations at which we need to determine 
the arrival times of the assigned AVs in order to conhgure 
their travel schedules. V' includes the first locations visited 
by all the vehicles (i.e., a^’s), the sources and destinations 
of the requests (i.e., s^’s and d^’s, respectively), and the 
locations of the refuel stations (i.e., i G V). £' is dehned 
as {{i,j)\i,j G V'} such that there exists a shortest path from 
i € V to j S V in G. For (i,j) G £', the associated and 
tij are the sums of costs and times, respectively, of all the 
edges constituting the corresponding shortest path in G. In 
the subsequent computation, we focus on G'{y',£') instead 
of G{V,£). The reasons why we adopt this transformation 
are two-fold; First, the number of variables needed in the 
formulation can be dramatically reduced. The set V\V' are not 
important as all conditions conhning to the locations specified 
by the vehicles and requests are restricted to V' only. In this 
way, the efficiency of solving the scheduling problem can be 
improved signihcantly. Second, this can improve the flexibility 
of the schedules. Consider that AV k goes from vertices 1 
to 4 and there exist two paths connecting them as. Path 1; 
1 —> 2 —4, and Path 2; 1 —3 —4. Suppose that vertices 1 
and 4 belong to V but vertices 2 and 3 do not. To satisfy the 
requirements imposed on k, we need to determine the times by 
which k should arrive at vertices 1 and 4 only, i.e., and If 
vertices 2 and 3 are also included in the formulation and Path 1 
is finally chosen, will be specihed by solving the scheduling 
problem and thus k needs to arrive at the vertices by t\, t\, and 
t\, respectively. If not, only and t\ are specified and we can 
give flexibility to k of arriving at vertex 2. can be any time 
between and as long as the required travel times spent on 
(1,2) and (2,4) have been considered. This flexibility gives 
room for k to respond to any instantaneous traffic incidents 
which may disturb its original travel plan. This also allows k 
to change to Path 2, if needed, without altering the original 
travel plan. 

Note that the preprocessing step can be skipped if the 
scheduling problem constructed directly from G{V,£) can be 


solved efficiently. However, if the preprocessing is required 
to simplifly the scheduling problem, it can be considered as 
a number of result lookups. As Cij’s generally refer to the 
travel distances which are invariant, the results of the shortest 
path computations are also invariant. In fact, before the system 
operates, we can first compute the shortest path for every pair 
of locations in V. When the preprocessing is triggered in an 
interval, we just need to look up the pre-computed shortest 
path results. Hence, the time cost of preprocessing can be 
considered negligibly small. 


B. Problem Formulation 


We formulate the scheduling problem based on G'(V',£'). 
The given data for the problem parameters include the graph 
G'{V'^£') with costs c^’s and travel times fij’s, the set of 
transportation requests TZ, and the set of AVs K.. We define 
several variables for the problem. Binary variables x^^ ’s are 
used to indicate which connections will be traversed by the 
vehicles, as 


1 if vehicle k traverses 
0 otherwise. 


We define binary variables y^’s for the assignment of the 
vehicles to the requests, as 


r 1 if vehicle k is assigned to request r, 
( 0 otherwise. 


For i G V, binary variables pf’s are utilized to indicate the 
refuel stations at which the vehicles end their routes, as 

fe _ J 1 if vehicle k ends its route at vertex i gV, 

(0 otherwise. 


We need to specify the times and occupancy conditions at 
various locations along the routes. Let be the time by which 
k should arrive at vertex i and /* be the number of passengers 
in k right before it leaves i. 

We aim to construct economical schedules for the AVs and 
thus we minimize the total operational cost with the objective 
function as 


We define a set of constraints to confine the scope of the 
variables so that the requirements discussed in Section |III-B| 
are satisfied. Each transportation request can only be served 
once and thus we have 


^t/," = l,Vre7^. (3) 

fcG/C 

Each AV will end at one of the refuel stations if it is assigned 
to a request. This is specified by 

(4) 

iev 

If AV k is not assigned to any request, we do not need to 
determine a path for k so as the final stopping refuel station 
for k. Thus it is possible to have 9i ~ ^ some k. 
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Let JV^{i) and JV~ (i) be the sets of incoming and outgoing 
neighbors of vertex i, i.e., = {j G V'\{j,i) G £'} 

and = {j G V'|(i,j) G £'\. We model a path with a 

network flow model. A path starting at au and ending at i G V 
can be dehned with the following: 


0 < 

E - 

V x^ 

<Y,yrykGic, 

(5) 

i&M- (afc) 

i&M+{ak) 

r 


0 < 

M 

1 

E 4^ 

pf,ViG V,fcG/C, 

(6) 






E 

H 

II 

M 

x’(j,ytGV' 

\ V U {ak\k G 1C}. 

(7) 




Eq. ^ dehnes for the starting vertex of k, where a starting 
vertex has one unit of net outgoing flow. Vr specihes if a 
path needs to be dehned for k. If there are no requests assigned 
to k, Vr becomes zero and is not the starting vertex of 
any paths for k. Similarly, (|^ dehnes for the destination vertex 
of k and the exact vertex i ended by k is indicated by gf. If k 
ends at i G V, (|^ will allow i to have one unit of net incoming 
How for k. For other vertices, Q sets the conversation of how 
by equalizing the corresponding incoming and outgoing hows. 

If request r is assigned to vehicle k, k needs to pass through 
the pickup location Sr of r. It is equivalent to having positive 
outgoing how for k at Sr as 

^ xl,>y^yrGn,kGJC. ( 8 ) 

iG7tA-(sr) 

Similarly, k needs to pass through the dropoff point dr of 
request r when r is served by k. This requires positive 
incoming How for k at dr as 

E >y^VrG7^,fcG/C. (9) 

iGJG+{dr) 

Note that specifying incoming How for Sr is not sufficient as it 
is possible to have zero incoming how when k begins its path 
at Sr exactly. Similarly, it is not sufficient to specify outgoing 
How for dr as it is possible to have zero outgoing How when 
k ends its path at dr- 

No matter where vehicle k goes, it cannot travel continu¬ 
ously longer than its operational time limit specihed by T^.. 
Moreover, it needs to take at least in order to reach the 
initial vertex of its path. Hence we have 

tl<t^ <fk,yi GV',k G 1C. (10) 

Let M be a sufficiently large positive number. When vehicle 
k traverses edge the time at j should be larger than or 

equal to the time at i together with the travel time on (i,j), 
i.e., tij. This can be specihed by 

tj >t^ + Uj — M(1 — x^j),yk G IC,i,j G V'. (11) 

When vehicle k is assigned to request r, the actual ride time 
to reach dr from Sr should be no larger than the maximum 
ride time Tr specihed by r, i.e., 

<Tr + M{l-y^)yr GTZ,kGlC. (12) 


If request r is served by vehicle k, k should arrive at Sr 
within the service starting time window [e^, Ir] specihed by r. 
This can be expressed as 

e, - M(1 - y^) <tl <lr + M(1 - Vr GU^kGlC. 

(13) 

Passengers being served occupy seats and the capacity limits 
of all vehicles should be satished at all times. So we have 

0</f <Qfe,VzG V',fcG/C. (14) 

At ttfe, some passengers induced from TZk may get off k 
and new passengers may get on k from other requests. The 
occupancy conditions of the AVs at their initial vertices a^’s 
are given by 

E ^-yr- E dryt^kGJC. (15) 

r\sr=ak r\dr=ak 

When k traverses from i to j along {i,j), vertex j may be 
the pickup locations of some requests and dropoff locations of 
some other requests. The relationship between the occupancy 
conditions of AV k at i and j can be specihed as 

fk ^ ^ Qrv’;, (16) 

r\sr=ak r\dr=ak 

yi,j GV\k G 1C. 

When an AV reaches a refuel station, all requests assigned 
to it should have been settled and no passenger should be 
accompanied to the end of the route. This is described by 

/f <M(l-5f),VzGL,fcG^. (17) 

Recall that there are two kinds of requests which have 
already been assigned to the AVs before the current scheduling 
interval, i.e., TZk = TZk U TZk. As a (nearly) real-time appli¬ 
cation, with updated information, we may further improve the 
system performance by revising the already assigned requests. 
For those requests currently being served, e.g., r G TZk with 
the passengers sitting in k, we can consider those r’s as 
“new” requests starting the service at the the starting node 
Ofc by setting Sr = ak and affirming y^ = 1. As fc has been 
serving r by following a previously determined schedule, we 
can update its Tr by shortening the elapsed time. The service 
starting time window is no longer important and thus we set 
Cr = —oo and Ir — -boo. There is no change to For 
those requests TZk’& which have been previously assigned to 
k but not yet been served, we may reschedule r G TZk with 
other AVs if it can result in lower cost. As the passengers do 
not concern about which vehicle would eventually provide the 
service, it may be more efficient to re-allocate those r’s in 
TZk to other more appropriate vehicles with lower operational 
cost. This enhances the Hexibility of the system. As a whole, 
the scheduling problem is dehned as 

Problem 1 (Scheduling): 

minimize <13 

subject to 

over xfj G {0,1}, y* G {0,1}, pf G {0,1}, G M+, 
/{= G Z+,Vz,j G V',( G V,r G 7^, fc G /C. 
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Problem [T] has a linear objective function and linear equality 
and inequality constraints. Some of its variables are binary 
while the rest are real. Thus the scheduling problem is a 
mixed-integer linear program (MILP). Although the prepro¬ 
cessing step discussed in Section IV-A helps simplify the 
problem, the numbers of variables and constraints also grow 
with the sizes of TZ and K.. As those invalid requests have been 
removed by admission control (discussed in Section [V] l, this 
MILP is always feasible and all requests must be served. As 
long as all c^’s are positive, the solution of Problem 1 does 
not result in zero cost and the schedule without serving any 
requests will never be a solution. 


C. Complete Schedule Construction 

Since the vehicles are unmanned, we need to provide 
complete instructions about the paths and schedules so that 
they know when and where they should go in order to 
provide services to the customers. Solving the MILP gives the 
solutions for xf^-’s, y^’s, bCs , and f^’s. As being binary 
variables, the results of tcj^’s and y^’s are unambiguous. The 
latter tells which vehicles are assigned to the requests. The 
former explains the route of each k in G' starting at Ofc and 
ending at one of the refuel stations. The paths determined 
in G' in turn infer the corresponding complete routes in G. 
Recall that we have determined the shortest path from i to 
j in G corresponding to the edge {i,j) G £'. By inserting 
the shortest paths for every pair of adjacent vertices along the 
paths based on G", the complete routes in G can be derived 
accordingly. 

Note that ([T0li-([T3]l define the scope of fj’s in the form 
of inequality. The resulting f^’s make feasible time schedules 
but may not be specific enough leading to ambiguity. For 
example, if the arrival of k at location i at any moment in 
is feasible, a reasonable way is to set = t' and this 
enhances the flexibility for the later scheduling intervals. To 
construct the schedule of k, we examine the path computed 
from For the first vertex, we set For any 

subsequent vertices, says from i to j, we can add the travel 
time on edge {i,j) to the settled time at i to obtain the settled 
time at j, i.e., tj = +tij. If vertex j induces a request, we 
need to fulfill its service starting time window and thus we 
have tj = max{ff + tij,er}. 

Similarly, ([T4l)-([g also confine the occupancies of the 
vehicles at various locations with inequalities. The exact seat 
conditions cannot be told from the resulting /f’s. Usually, we 
only concern about the seat conditions at the customer pickup 
and dropoff points, i.e., sC& and dCs- We can examine the 
route computed from ’s again and determine the occupancy 
conditions. For example, k goes from i to j on If j is 

the service starting location of request r, we add the number 
of seats required for r to the occupancy of fc at i to get its 
occupancy at j, i.e., fj = +qr- If j is a service destination 
location instead, we subtract the seats taken by r from the 
occupancy of fc at i to get its occupancy at j, as fj = ft — Qr- 
In this way, the complete schedules of the vehicles with duty 
assigned can be determined and the vehicles just need to follow 
the schedules to accomplish the services. 


V. Admission Control 


Recall that, in Section IV all requests submitted for 


scheduling are assumed to be admissible and need to be served. 
In this section, we investigate the admission control problem. 
We first formulate the problem and then study the variations in 
the presence of traffic congestion and no-show of passengers. 


A. Problem Formulation 

Admission control is responsible for determining a set of 
requests suitable for scheduling. In other words, after admis¬ 
sion control, we will produce a subset TZ CTZ for subsequent 
scheduling, where TZ is the set of all available requests and TZ 
will be settled by appropriate AVs in scheduling. However, to 
judge if a particular request r is admissible, we need to check 
not only its feasibility but also its profitability, i.e., whether 
serving r will induce a positive net profit. Determining the 
net profit from r involves its induced cost, which is regulated 
through scheduling. Hence there is no clear precedence rela¬ 
tionship between scheduling and admission control and these 
two processes should be considered simultaneously. 

We can interpret the requests and AVs as the demand and 
supply of transportation services, respectively, and then the 
constraints of Problem [T] define the scope of matching between 
the demand and supply. The constraints can be satisfied more 
easily with larger 1C and smaller TZ. Practically, the size of JC 
is generally fixed as the system would not suddenly employ 
more AVs into the fleet or many AVs become out of service all 
of a sudden. However, the requests submitted are absolutely 
external from the system; the system can neither forbid the 
customers from submitting requests nor modify the attributes 
in the requests to match the conditions of AVs. In fact, just 
a single inappropriate request (e.g., a request with very short 
tolerable ride time) can make Problem [T] infeasible and the 
scheduling collapse. To avoid this, the system should perform 
admission control by screening out any inappropriate requests 
before undergoing the scheduling (see Fig. [^. Consider that 
entertaining a request results in revenue. Although the sys¬ 
tem cannot modify the submitted requests, it has the right 
to dismissing any requests by sacrificing the corresponding 
revenue. Admission control manipulates TZ with the following 
objectives: 1) Produce a subset of requests C 7?. so that the 
scheduling process can be performed, i.e.. Problem is made 
feasible with TZ', 2) Maximize the profit incurred. 

Consider that we admit iz for scheduling with Problem 
which can be re-written as 


minimize (/>(a) (18a) 

subject to a G Z{TZ), (18b) 

where a = {xf^-} U {y^} U {gj} U {tf} U {ft}, = 

J2i jev ths feasible region of 

Problem M with respect to TZ. Let be the revenue made 
when admitting r G TZ and define 

_ r 1 if we admit r G TZ for scheduling, 

[0 otherwise. 

We also define the admission function a-{TZ, [zrjren) which 
returns TZ C TZ based on Zr such that r G if Zr = 1. The 




total profit is the difference between the total revenue and total 
cost, i.e., PrZr—<t>{oi). Then we formulate the admission 

control problem as 

Problem 2 (Admission Control): 


maximize 

^{TZ, [Zr\r(^n) = X! ~ 4>{o:) 

(19a) 


rGTZ 


subject to 

TZ = a{TZ, [zr]ren), 

(19b) 


Zr = 1, Vr G TZk, k G TC, 

(19c) 


a G argminj^ja) : a G Z{TZ)}, 

(19d) 

over 

o., TZ G TZ, Zr G '[0,l},Vr G TZ , 

(19e) 


where ( |19c| l ensures that those requests admitted in the previ¬ 
ous operating intervals will still be admitted in the current 
interval. We cast admission control as a bilevel optimiza¬ 
tion problem, which consists of an upper- and a lower-level 
optimization. $ is the upper-level objective function with 
upper-level variables TZ and z/s. (j) represents the lower-level 
objective function with lower-level variable a. Eq. ( |19d| i is 
in fact ( fTSl l, and thus, we cast Problem as a constraint 
of Problem The upper-level optimization is to manipulate 
the whole set of requests TZ and determine TZ such that TZ 
can maximize the total proht. The lower-level optimization is 
to schedule the AVs to serve the set of admissable requests 
TZ so that the retained cost is the lowest. The two levels 
of optimization are inter-related; the upper level requires the 
result of the lower level, i.e., a, in order to get TZ, while the 
lower level needs the result from the upper level, i.e., TZ, in 
order to output a. Note that if the upper level produces TZ 
which makes Z infeasible, the resulting a will return -|-oo for 
the objective function of ( |19d| i, which will in turn make the 
objective function ( |19a| l retain — oo. 

Bilevel optimization is in general difficult to solve. A bilevel 
problem with a linear objective function and linear constraints 
is NP-hard 1291 . As seen from we are manipulating 
discrete variables in the problem. As classical methods for 
bilevel optimization usually assume smoothness or convexity 
1^ . those classical methods are not applicable to Problem]^ 
As inspired by we decide to tackle the problem with 

an evolutionary heuristic approach. Evolutionary approaches 
are commonly applied to bilevel optimization problems in 
transport science. Eor example, in l33l . Differential Evolution 
(DE) is employed to address the optimal toll problem, which is 
about setting polls to control congestion, and the road network 
design problem, which determines the capacity enhancements 
of network facilities. In ll34l . GA is applied to the transit 
road space priority problem, which optimizes the system by 
reallocating the road space between private car and transit 
modes. We will design a GA-based algorithm to solve Problem 

Before discussing the details of the algorithm, we dehne 
admissibility and give some analytical results for Problem 
which can help design the algorithm in the next section. 

Definition 1 (Admissibility): A set of requests TZ is admis¬ 
sible if [zr]ren produces TZ, which results in hnite proht, i.e., 
^{TZ, [zr]rGn) > -oo. 

Theorem 1: We have the following results for admissibility; 

1) Consider that a subset of requests 7^ C 7?. are admissible. 

Let 'PiJZ) be the power set of iZ. Any TV € 'P(TZ) is also 


admissible. 

2) Eor any singleton {r} C TZ, if {r} is not admissible, any 
superset TZ D {r} are also non-admissible. 

3) Consider subsets of requests, TZi,TZ 2 C TZ, and subsets 
of vehicles lCi,lC 2 C 1C. Suppose ICi D IC 2 = 0. If 7^i 
and 7^2 are admissible by ICi and IC 2 , respectively, then 
TZi U 7^2 are also admissible. 

Proof: Eor Statement 1, Constraint ( |19b| i dehnes TZ, 
which is an input of Constraint ( |19d| ). It is sufficient to show 
that the removal of any r £TZ will not make ( |19d| i infeasible if 
the participating AVs can serve all the requests in TZ. Suppose 
that r is removed from TZ and AV k would have assigned to 
serve r if r had been admitted, k can still follow the path as 
if r is present. Hence ( |19d| l is still feasible for TZ\r. 

Eor Statement 2, a non-admissible r means that it is 
impossible to arrange an AV to entertain r. We will never 
be able to provide services to a set of requests containing r 
as its component r can never be served. 

Eor Statement 3, we can represent T^i U 7^2 by three non¬ 
overlapping sets 7^1 \( 7 ^ 1 ( 17 ^ 2 ), TC 2 \{TZir]TZ 2 ), and TZ 1 CTZ 2 . 
Since ICi and IC 2 are mutually exclusive, TZi \ (TZi (17^2) and 
TC 2 \ irZi n TZ 2 ) can be served by JCi and IC 2 simultaneously. 
Each r G TZi n TZ 2 can be admitted by either /c G or 
fc G /C 2 . ■ 

Lemma 1: The system will not make negative proht. That 
is, for any TZ, Problem must have at least one feasible 
solution whose objective function value is non-negative. 

Proof: We separate TZ into the previously admitted and 
newly received requests, i.e., {TZk} and TZ \ {TZk}. 

Eor the newly received requests, we can always set Zr = 
0,\/r gTZ \ {TZk}. Then ( |19b| ) gives TZ — %. Eq. ( |19d| l returns 
a with 4>{a) = 0 as no requests need to be served and thus 
no AVs have been used to provide service. Hence we have 

SrG7?,\{7?,fc} Pr^r — 4>{cy) = 0. 

Eor the previously admitted requests, since they are admitted 
in some previous admission control processes, they must incur 
non-negative proht when they were admitted as new requests 
before. Otherwise, we would not have admitted them at the 
hrst place. ■ 

Lemma [T] implies that Problem must be feasible. 

Theorem 2: Consider two subsets of requests TZ and TZ' 
with TZ TV <Z TZ. \f both TZ and TZ' are admissible, then 
TZ' will not be less prohtable than TZ, i.e., sup<i>(7^, {zr\r G 
TZ}) < sup$(7^',{z^|r G TZ'}). 

Proof: Suppose sup <h(7^, C IZ}) > 

sup $(7^', {zr\r G TZ'}). We write TZ' =TZC {TZ' \ TZ). Then 
we have 

sup<i>(7^', G 7^'}) 

= sup$(7^, {zr\r G TZ}) -I- sup<i>(7^' \ TZ, {z^f G TZ' \ TZ}). 

By Lemma sup$(7^' \ TZ,{zr\r G TZ' \ TZ}) has a value 
larger than or equal to zero. This induces a contradiction. ■ 
Theorem implies that entertaining more requests will not 
reduce the amount of proht made. 

B. Variations 

Here we investigate how traffic congestion and no-show 
of paasengers impact on admission control (and scheduling). 
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Basically, we will see that under these circumstances, the 
proposed admission control and scheduling mechansims can 
still be applied but we may need some additional minor 
arrangements to handle various situations. 

1) Traffic Congestion: Traffic congestion has direct impact 
on the travel time tij for some {i,j) G S and subsequently 
affects the admissibility of requests. Recall that the system 
operates in a hxed-interval basis and each interval generally 
lasts for a few minutes (see Section |III-B| i. We basically 
assume that, within an interval, the parameters, including the 
travel times, are constant or with very small changes such that 
the results of admission control completed for that interval 
are still valid. If the travel times are relatively fast changing, 
we need to shorten the duration of the intervals to make 
the assumption valid. On the other hand, if the travel times 
are slowly varying, we may lengthen the durations to reduce 
the computation burden. Hence, the duration of the operating 
intervals depends on the traffic conditions of the deployed 
service area. 

Now consider that tij in the current interval has been 
updated such that its value is different from that used in 
the previous interval. There are three cases for the possible 
influence: (i) tij does not involve in TZk for all /c; (ii) tij 
involves in TZk for some fc; and (iii) tij involves in TZk for 
some k. For Case (i), since tij has not been used to serve any 
requests, its change does not affect the schedules of any AVs. 
Hence, nothing needs to be done solely based on tij. For Case 
(ii), although t^ has been used to determine the schedules of 
some AVs, the involved requests have not been served yet. 
We can simply consider these requests as newly submitted 
requests and perform admission control and scheduling with 
them again. For Case (iii), tij affects those schedules which are 
being implemented by some AVs. In the subsequent intervals, 
the scheduling process will see if the road segment {i,j) can 
be avoided by determining other shortest paths. If not, as 
the passengers are being served, it may not be appropriate 
to ask them to shift to other vehicles for their journeys and 
nothing can be done further operationally. However, we may 
compensate the passengers in the marketing perspective, e.g., 
by issuing cash coupons for future rides. 

2) No-show of Passengers: No-show refers to the situation 
that some or all passengers of a paricular request are absent 
at the scheduled pickup time. If a passenger cannot arrive at 
the pickup location on time, this will be considered as no- 
show. If some but not all passengers are absent, the schdule 
of the designated AV is unaffected but fewer seats are required. 
These unused seats can be released to serve other appropriate 
requests in the later intervals. If all passengers are absent, the 
“resources” allocated to the request can be released in the 
subsequent intervals right after its original pickup time. This 
gives the AV more flexibility in time and occupancy to serve 
future requests. In the business perspective, there may exist 
some penalty policies to discourage such activities. 

VI. Genetic-Algorithm-Based Solution Method 

In this section, we propose a solution method to tackle 
Problem We adopt a GA-based framework to structure the 


method. Some of its components are designed based on the 
analytical results discussed in Section 

A. Working Principle of Evolutionary Algorithms 

Evolutionary Algorithms (EAs) refer to a class of opti¬ 
mization algorithms, whose designs are inspired by various 
natural phenomena. Examples include GA ||35l, DE ||36|, and 
Chemical Reaction Optimization (CRO) 121 ■ Different EAs 
generally have similar working principles: An EA samples the 
solution space of the problem iteratively and tries to locate a 
global optimum after examining a limited number of candidate 
solutions in the solution space. In each iteration, with some 
operators, it generates a population of candidate solutions 
based on those obtained from the previous iterations and their 
corresponding objective function values. It tends to converge to 
the global optimum along the iterations and it terminates when 
a stopping criterion is matched. Different EAs have different 
designs of their operators. Eor example, GA is designed based 
on the ideas of natural selection in genetics while CRO mimics 
the nature of chemical reaction processes. Unlike most of the 
traditional optimization approaches, EAs require the problem 
to be neither convex nor differentiable. In each algorithm run, 
they only need to sample a number of candidate solutions and 
evaluate their solution qualities with the objective function. 
Hence, a search with an EA usually incurs many objective 
function calls. As discussed, EAs have been shown effective 
in solving bilevel optimization problems in transport science. 
We are going to adopt the well-established GA framework 
to facilitate the design of a method which can return good 
solutions for Problem in a practical sense. 

B. Distributed Scheduling 

When an EA is employed to address Problem many 
candidate solutions will be generated. To evaluate the quality 
of a particular candidate solution, we need to compute ( |19a| ) 
once, which also needs to examine one time. In other 

words, a single run of EA requires to solve Problem [T] 
many times. When the lower-level optimization is simple, 
the computational burden of solving it many times may still 
be acceptable. However, this is not the case for Problem 
where the required numbers of variables and constraints grow 
exponentially with the quantities of transportation requests and 
serving AVs. This implies that we need to a more effective way 
to solve Problem [T] in order to tackle Problem 

Consider that TZk C 7?. is the subset of requests assigned 
to vehicle k. Suppose that we know the distribution of the 
requests to the vehicles, i.e., TZk for all k. Since each request 
is only served by one vehicle, we have TZk CTZi =0, for any 
k,l G K, k fz I, and [JkeK'^k = 7^- When given TZk, we 
consider the following problem: 

Problem 3 (Scheduling Subproblem for vehicle k): 

maximize E djXij (20a) 

ijGV' 

subject to (20b) 

iGV 
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0 < H - Y. ^ H 

JG7'/-(afc) i&N'+iak) »' 




(20c) 


0 < Y Y G V, 

j&Af+{i) j&Af-{i) 


(20d) 


Y = Y G V' \ V U {Ofc} 

j&Af+{i) j&M-{i) 



(20e) 


(20f) 



Y ^'id,.>yryrG'Rk, 

(20g) 



il < it <fk,yiG V, 

(20h) 

it > it + kj - M{1 - x%),yi,j G V 

(20i) 

ii - it <Tr + M{1 - yt),yr G TZk, 

(20j) 

Cr - M(1 - yt) < it <lr + Mil - yt 

'),Vr G 


(20k) 

0<ft<Qk,yiCV', 

(201) 

ft> Y Y <l^yt 

(20m) 

r\sr=ak r|dr=afc 


fYft- Mil - 4) + y] qrt 

r\sr-=ak,r^'lZk 


Y grvtyhjci^', 

(20n) 

r\dr.=ak,r^TZk 


ft < Mil-gt), ytGV, 

(20o) 

xt G {0,1}, t C {0,1},gt C {0,1},it 

G M+, 

ftGZ+,yi,jGV,lGV,rGTZk. 

(20p) 


Solving Problem only allows us to obtain the serving path, 
the schedule to reach various locations along the path, and 
the capacity conditions of vehicle k for serving the requests 
indicated by itk- Problem looks similar to Problem |T| but 
indeed much simpler. It does not contain ([^ and it manipulates 
fewer variables as those related to vehicles other than k are 
not included. It also possesses fewer constraints because of 
fewer variables. 

For simplicity, similar to ( fTS] !, we also write the solution, 
objective function, and the solution space of Problemas ak, 
(j)k{ctk) and Zk, respectively. 

Theorem 3: When given TZk <Z'R-,\/k ^ K, such that TZk H 
TZi = 0, for any k,l G 1C, k ^ I, and TZk = TC, solving 
Problem for all fc G /C is equivalent to solving Problem [T] 
i.e.. 


inf ip{a) = Y, inf 4>k{ak), 


Ol^Z 


and = x^Vr = fr, gf = gf, and /f = /f, 

Vi,j gV ,l GV,r G Tik,k G K. 

Proof: When given such TZk C TZ,yk G 1C, we can 
construct y^,yr G TZ,k G 1C, such that 0 holds. In this 
way, we can remove 0 from Problem [T] Without Constraint 
0 . the objective function and the rest of the constraints of 


Problem [T] become separable in terms of k: (|^ gives the sum 
of costs spent on the vehicles; Eqs. 0-0 specify the paths 
traversed by the vehicles, each of which are independent; Eqs. 
([^-([^ conhne the time requirements at various locations 
along the vehicular paths; Eqs. ([T4li-([g limit the passenger 
capacity conditions along the vehicular paths. If we group the 
terms of 0 and the constraints 0-® for each k, we will 
have \1C\ problems, each of which is given by ■ 

Theorem states that when the assignment of requests to 
the vehicles is known, solving the \IC\ individual schedul¬ 
ing subproblems distributedly can retain the solution of the 
original scheduling problem. Note that this result is dedicat- 
edly developed based on some characteristics of the problem 
formulations and it generally cannot be applied to the other 
scheduling problems. Unlike general distributed optimization 
EH, ll39l . our result here does not require techniques like 
message-passing. As a result, the |/C| subproblems can be 
solved by |/C| computing units distributedly. Assuming that 
the vehicles are connected through advanced vehicular com- 
imunication technologies at all times, an obvious option of 
the computing unit is the AV. Thus, by Theorem if we 
can assign each vehicle with the requests it needs to serve, 
each vehicle can determine a feasible path per se to serve 
the assigned requests with the lowest cost by solving Problem 
[^concurrently. However, when the communications between 
a particular AV and the control center are interrupted, the 
corresponding subproblem can be delegated to an unoccupied 
computing unit at the control center or even to the cloud 
instead. The computed scheduling result can be returned to 
the AV when its communications have been resumed. 

C. Algorithmic Components 

Since GA is one of the most popular EAs, we adopt a 
GA-based design to address the admission control problem. 
GA generates a sequence of candidate solutions using opera¬ 
tions inspired by natural evolution, e.g., inheritance, selection, 
crossover, and mutation. Here we introduce various algo¬ 
rithmic components before discussing the overall algorithmic 
design: 

1) Chromosome: A chromosome specihes a candidate so¬ 
lution of Problem [^ While the lower-level optimization is 
handled by a standard MILP method, our GA approach 
is mainly used to handle the upper-level optimization. A 
chromosome is represented by a 1 x |7?.| binary vector 
z = [zi,..., Zr,..., Z| 7 ?,|], together with a vehicle assignment 
vector K = [ki, ..., Kr,..., K| 7 ^|], where represents the 
vehicle assigned to r if Zr is of unity. Note that the introduction 
of K is the trick to carry out distributed scheduling discussed 
in Section VI-B Although k can be determined in ( |19d| ) if we 
apply the original formulation of scheduling ( [T8] l, there is no 
harm in manipulating k together with z in the chromosome 
level. This makes distributed scheduling feasible and the 
beneht of computation time saving will be clear in Section 
VII-A| During the course of search, we maintain a population 


of Npop chromosomes. 

2) Fitness Evaluation: We evaluate the htness of each 
chromosome in a distributed manner. The htness evaluation 
process is illustrated in Pig. [^ and it consists of hve steps: 
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(1) Grouping requests in TZk'- Each chromosome i contains 
z* and At®. For those r’s with z® = 1, based on k®, at 
the control center, we can divide TZ into \JC\ groups, i.e., 
TZk.yk G K. 

(2) Request information distribution: For each k, the control 
center transmits TZk to AV k, e.g., via VANET. 

(3) Distributed scheduling: Modern vehicles are generally 
equipped with computers and thus each k can solve the 
individual Problem simultaneously with other vehicles. 
Those AVs with empty TZk assigned can skip the com¬ 
putation. 

(4) Individual cost return: The individual vehicles transmit 
the computed costs of scheduling to the control center, 
e.g., via VANET. 

(5) Fitness computation: Based on Theorem the cost 

associated to the chromosome is the sum of the ob¬ 
jective function values of Problem determined by the 
individual AVs, i.e., = k). 

Then the fitness of the chromosome can be computed 
as 4>(z, k) = PrZr ^^kGlC 't'k{c>ik\Kr — 

Note that $(z, k) becomes —oo if and only if any request r 
with z^ = 1 is non-admissible. The advantages of undergoing 
the above process are three-fold: 

(i) The computation time can be dramatically reduced. 
Among all the computation components in the algorithm, 
scheduling is the most computationally demanding. If 
each vehicle can compute their own schedules, all the 
individual scheduling subproblems can be solved simul¬ 
taneously. 

(ii) All entities need to manage the necessary data only, p^ 
is the result of the deal between the customer and the 
control center. With distributed scheduling, the usage of 
Pr is restricted to the control center and no vehicles are 
involved. Moreover, after a vehicle solves its scheduling 
subproblem, its computed schedule is stored in that 
vehicle only, but not the control center nor any other 
vehicles. 

(iii) The amount of communications keeps minimal. In each 
evaluation, the only data needed to be communicated 
between the control center and the vehicles are the re¬ 
quests assigned to the individual vehicles (in Step 2) and 
the computed scheduling costs (in Step 4). The system 
does not require a sophisticated communication system 
to satisfy the communication requirements. 

3) Tabu List: We construct a tabu list r^. for each request 
r to reduce the size of the search space, contains those 
vehicles k which cannot serve r. As implied by Theorem[T| if a 
request r is not admissible by k, any set of requests containing 
r will also not be admissible by k. In other words, we will 
never need to consider those k in when configuring Kr- 
Unlike Tabu Search ioi, we do not need to update the tabu 


^By abuse of notation, we write $(2, ft) = [zr]) to emphasize the 

structure of the chromosome. 


Fig. 3. 



Fitness evaluation process. 


lists during the course of search]^ t^’s are only constructed in 
the initialization phase of the algorithm and utilized in both 
initial population generation and mutation. 

4) Selection: In each generation, a fraction X^ate of -^pop 
survives and the rest of (1 — Xrate) will be replaced by the 
children bled in the processes of crossover. We apply weighted 
random pairing ED to select the survived chromosomes to 
perform crossover. 

5) Crossover: Crossover is an operator in GA to achieve 
intensification. In each operation, it manipulates two parent 
chromosomes to breed two offspring. The offspring inherit 
the merits from their parents and thus they tend to have better 
fitness values, i.e., higher objective function values of ( |19a| l. 
By Theorem]^ a larger set of requests will improve the fitness. 
Also based on Statement 3 of Theorem [T] we manipulate 
the chromosomes with crossover as follows. Parents i and 
j reproduce offspring i' and j'. i' admits all those r’s as i 
does with the same set of vehicles. If there is any k which 
is adopted in j but not in i, we randomly adopt one such k 
in i on those r’s which are not admitted in its parent i. We 
produce an offspring j' dominantly inherited by the parent j 
similarly. In this way, the offspring are likely to admit more 
requests resulting in higher fitness. 

6) Mutation: Mutation exhibits diversification to prevent 
the algorithm from getting stuck in local optimums and we 
basically follow ED to design mutation. We control the 
amount of mutation with a mutation rate G [0,1]. We 
apply elitism to the chromosome with highest fitness in the 
population and only the rest undergo mutation. A mutation 
occurs on bit of chromosome i and the number of mutations 


^As discussed in Section 


III-B 


admission control is completed in the duty 


assignment sub-interval once m each operating interval. Such sub-interval is 
short so that it is unlikely to have great changes to the positions of the AVs. 
Thus the tabu lists can be assumed to be static throughout the admission 
control process happened in each interval. However, the tabu lists may need 
to be updated in the next interval as the vehicles may have moved to other 
positions. 
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Fig. 4. Flow chart of the algorithm. 


taken place in each generation is p x {Npop ~ 1) x |^|- If we 
perform mutation on z®, we toggle z®. If z® is changed from 
0 to 1, we randomly assign Kr a k which is not in the tabu 
list Tr- If z® is changed from 1 to 0, we set Kr — 0. To 
further enhance diversification, besides the elite chromosome, 
each chromosome has a probability of 7 to be replaced by a 
random chromosome. 


D. Algorithmic Design 

We basically follow ED to design the algorithm, which 
consists of three stages: initialization, iterations, and the final 
stage. The flow chart of the algorithm is given in Fig. We 
maintain the chromosomes with feasible candidate solutions 
during the whole course of search. 

1) Initialization: In initiation, we define all the system 
parameters, e.g., Np^p and Xrate^ construct the tabu 
list Tr for each r. Then we create the initial population of 
chromosomes, each of which is assigned with one random 
request r associated with a vehicle not in its tabu list Tr. This 
can ensure all chromosomes are initially feasible. We evaluate 
the fitness of the initial chromosomes before the iterations 
start. 

2) Iterations: In each iteration (or called generation), we 
manipulate the candidate solutions held by the chromosomes. 
Before any modification, we back up the feasible candidate 
solutions stemmed from the previous generation. Then we 
perform selection, crossover, and mutation to manipulate the 
chromosomes, followed by fitness evaluations. If any chromo¬ 
some possesses an infeasible solution, we retain its original 
feasible one from the backup. We check the stopping criteria to 
see if we continue with the next iteration or proceed to the final 
stage. One commonly used stopping criterion is termination 
after undergoing a certain number of generations. 

3) Final Stage: We output the best solution found in this 
stage. 

In general, the solution method is implemented in a central 
manner at the control center. When evaluating the fitness of 


the chromosomes, the scheduling tasks are distributed to the 
vehicles based on distributed scheduling. 

VII. Performance Evaluation 

We perform a series of simulations to evaluate different 
aspects of the algorithm. We consider a set of real taxi service 
data from sa, containing the pickup and dropoff times, and 
pickup and dropoff locations of a number of taxi trips served 
in the City of Boston. We sample 100 trip data whose pickup 
times happened within a period of 30 minutes in a day of 
2012 as the transportation request pool. Since no existing 
transport can offer flexible shared-ride services as our system 
does, we adopt the data for our system as follows: the earliest 
service starting time as the pickup time of the data, the latest 
service starting time as the pickup time plus 15 minutes, the 
maximum ride time as the actual trip time times 1.5, random 
seat occupancy in the range of [1,5], and 50% of the actual 
taxi fare as the charges. The driving distance and travel time 
between any two locations are determined through the Google 
Maps API. Based on 1431 . we assume that the fuel cost is 
16 cents per mile. We select five gas stations in Boston as 
the refuel stations for AVs. Each vehicle is assumed to be 
equipped with five seats and we randomly place the vehicles 
in the city. 

We perform the simulations on a computer with Intel Core 
17-2600 CPU at 3.40 GHz and 32 GB of RAM. They are 
conducted in the MATLAB environment, where the scheduling 
problem is addressed with YALMIP ll44l and CPLEX ll45l . 
We follow ED to set the GA parameters: Npop = 16, 
Xrate = 0.5, and p = 0.15, and we set 7 = 0.5. Recall 
that, to operate the system for a period of time, we need 
to do admission control for each operating interval within 
the period. To perform admission control for an interval, we 
need to undergo a number of scheduling processes. We try to 
evaluate the performance of the algorithm incrementally from 
the smallest module. Eirst we evaluate the computation time 
for scheduling. In the second test, we evaluate the performance 
of the algorithm on solving the admission control problem. At 
last, we examine the profits made when the system operates 
continuously for a period of time. 

A. Computation Time for Scheduling 

As Problem 1 is an MILP, we assume that CPLEX can 
return the optimal solution if the problem is tractable. So we 
focus on the computation time. When we look at Problem 
1, the numbers of variables and constraints grow exponen¬ 
tially with the problem size in terms of the quantities of 
transportation requests and vehicles. Hence the computation 
time for scheduling grows very fast with the problem size. Eor 
demonstrative purposes, we focus on small problem instances. 
We randomly generate 9 cases from the Boston dataset: three 
cases with three requests, three with four requests, and three 
with five requests. All the cases are served with five vehicles. 
Recall that we have two main ways to address the scheduling 
problem: (1) by solving Problem 1 as a whole and (2) by 
solving a number of Problem 3 collectively. Eor the latter, 
we can further arrange the subproblems to be solved (2.1) 
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Fig. 5. Data processing, communications, and computation of the three 
approaches in scheduling. 


en masse at the control center or (2.2) separately at the 
individual vehicles. Thus, there are three approaches in total 
and we call (1), (2.1), and (2.2) the centralized, cumulative, 
and distributed approaches, respectively. The data processing, 
communications, and computation of the three approaches 
are depicted in Fig. For the centralized and cumulative 
approaches, all data need to be collected and gathered at the 
control center from the passengers and vehicles for processing. 
After scheduling, the computed schedules will be distributed 
to the corresponding vehicles. For the distributed approach, 
the vehicular data are only maintained at the problem solv¬ 
ing agents, i.e., that vehicles per se, before and after the 
corresponding subproblems being solved. After scheduling, 
the resulting costs are transmitted back to the control center 
for the subsequent scheduling. When different numbers of 
vehicles are involved, the computation time can be noticeably 
different. To see this, for each of Cases I-IX, we examine all 
possible combinations of z and k (i.e., candidate solutions 
for chromosomes) and check their computation times for 
scheduling. We consider the time spent on communications 
negligible as it is usually much smaller when compared with 
the computation time. Fig. shows the average computation 
times for feasible schedules with different numbers of vehicles 
involved in each case. Since the computation time of the 
centralized approach grows too fast (e.g., 8.30 s, 69.25 s, 
and 6.72 x 10^ s for 3-5 requests, respectively), the time 
changes for the cumulative and distributed approaches would 
have become indistinguishable if the centralized data had also 
been displayed. For clearer representation, we skip the results 
for the centralized approach in Fig. In Fig. some bars 
are missing because no feasible schedule can be computed 
with particular numbers of vehicles involved. For example, 
one request in Case 111 cannot be scheduled with any vehicle, 
and thus, no results are shown for three vehicles for Case 
III. Generally, for the cumulative approach, the computation 
time grows linearly with the number of vehicles involved 
as more subproblems with similar size need to be solved. 


Sl-cum □l-dist Oll-cum ^ll-dist Slll-cum 


l-dist 



No. of vehicles involved 


(a) 3 requests 


SiV-cum QlV-dist 0V-cum 0V-dist §Vl-cum HVI-dist 



I 


2 3 

No. of vehicles involved 


(b) 4 requests 


SVIl-cum DVIl-dist SVIll-cum ^Vlll-dist SIX-cum BlX-dist 



No. of vehicles involved 


(c) 5 requests 

Fig. 6. Computation times for scheduling. 


For the distributed approach, the computation times with 
different vehicle sizes are more or less similar because the 
involved subproblems can be handled at different vehicles 
simultaneously. While the computation time of the centralized 
approach grows exponentially with the number of requests, 
that of the cumulative approach increases at a much slower rate 
and that of the distributed approach is approximately steady. 
Hence, it is not feasible to adopt the centralized approach. If 
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Case I • •• • Case IV Case VII 



No. of vehicles 

Fig. 8. Profits made with different numbers of vehicles. 

the vehicles have sufficient communication and computation 
capabilities, we suggest the distributed approach. Otherwise, 
we can only endorse the cumulative approach for scheduling. 

B. Admission Control in an Operating Interval 

Next we investigate the performance of the algorithm to 
address admission control for an operating interval. Each 
fitness evaluation involves solving the scheduling problem 
once and the computation time for each fitness evaluation is 
dominated by that for scheduling. Moreover, the computation 
time of the algorithm depends on the number of fitness 
evaluations needed. Since the population size is fixed in every 
generation, the run time of the algorithm can be estimated 
from the number of generations taken place and the results 
determined in Section IVII-AI Hence here we focus on the 
solution quality instead. 

We run the algorithm for Cases 1-lX. As we have examined 
all candidate solutions, we can acquire the optimal solutions 
of these cases. We repeat running the algorithm 20 times for 
each case. Fig. shows the average objective function value 
computed during the course of search for 40 generations. As 
absolute values do not help reveal the performance of the 
algorithm, the objective function values are instead normalized 
with the corresponding optimal values to standardize the 
presentation]^ For each data point, we also provide the error 
bars for the maximum and minimum values computed in the 
20 repeats. The performance of the algorithm in each case 
is similar. The algorithm starts with relatively low quality 
solutions and then converges rapidly to the global optimal in 
a few generations. The gap between the error bars diminishes 
after more generations have been taken place and this further 
confirms the convergence of the algorithm. When the problem 
size increases, it takes slightly more generations to have the 
algorithm converged. We can conclude that our algorithm is 
very effective in solving the admission control problem. 

We further investigate the total profits gained for the test 
cases with different AV population sizes. We perform the 
simulations with the same settings and repeat each test 20 
times. Fig. shows the average results with respect to 5, 10, 
15, and 20 vehicles. Since the resultant profit highly depends 

^An optimal solution has the normalized objective function value equal to 
one. 



SCasel ^ Case 2 


100 - 



Interval 

Fig. 10. Cumulative numbers of admitted requests. 

on the parameters of the respective requests and vehicles, 
the total profits gained from different cases are not directly 
comparable. Instead for each case, we show the percentage 
change of profit by normalizing the results with the profit made 
with 5 AVs. Since all cases show similar trends, for clearer 
presentation, we give the results for Cases 1, IV, and VII in 
Fig. only. In general, the more vehicles available, the higher 
profit can be made. However, the increase of profit is marginal; 
when compared with 5 AVs, the increase is just 1 — 2% in the 
presence of 20 AVs. The reason is that more available vehicles 
may result in more economical routes but the total distance 
travelled would not be shortened significantly. 

C. Admission Control in Consecutive Operating Intervals 

Here we consider operating the system consecutively for a 
period of time to entertain the 100 requests in the transporta¬ 
tion request pool. We consider two cases of different operating 
interval durations. In Case 1, there are 10 intervals, in each of 
which 10 random requests from the pool are to be scheduled. 
If a request is successfully admitted in an interval, it will be 
eliminated from the pool. Otherwise, it will be considered 
again in the subsequent intervals. The setting for Case 2 is 
similar but we consider total 20 intervals with 5 requests being 
processed in each interval. Five vehicles are arranged to serve 
the requests in both cases and we apply our algorithm to each 
interval for admission control. In other words, we perform 
10 and 20 admission control processes in Cases 1 and 2, 
respectively. 
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Fig. 7. Evolutions of the algorithm in solving admission control. 


Fig. 1^ shows the proht accumulated along the intervals, in 
which we consider the duration of one interval for Case 1 
is that of two intervals for Case 2. Note that the cost is the 
actual expense on gas based on the traversed distance and the 
revenue gained from serving each request is the discounted 
result of having 50% off from the real fare as if the request 
would be served by a normal taxi in Boston. The discount 
is used to compensate for the inconvenience of ride sharing 
and possibly longer ride time. This discount rate may be 
already attractive to many people to adopt our system instead 
of the normal taxi service. Hence the profit shown can be 
projected to a real business running in a similar scale. Fig.[T0| 
provides the numbers of successfully admitted requests along 
the same interval horizon as in Fig. We can see that Case 
2 can produce more proht by successfully admitting more 
transportation requests. With the same number of vehicles in 
service, the smaller the number of requests to be scheduled in 
an interval, the higher the success rate of admission control is. 
In real situation, we normally cannot dramatically increase the 
size of the AV heet and we would not intentionally reduce the 
number of AVs in service. On the other hand, it is much easier 
to adjust the number of requests to be scheduled each time by 
controlling the duration of each operating interval. In general, 
the shorter the interval, the smaller number of requests there 
are. Therefore, we would suggest to set the operating interval 
shorter, resulting in fewer requests to be scheduled each time 
and higher profits. Moreover, this will make the scheduling 
problem smaller by requiring shorter computation time to run 
the algorithm. 


VIII. Conclusion 

With advancements in technologies, AVs become feasible 
and can run on the roads. Various vehicular wireless communi¬ 
cation technologies allow AVs to be connected and respond co¬ 
operatively to instantaneous situations. This constitutes a new 
form of public transport with high efficiency and flexibility. 
In this paper, we propose the AV public transportation system 
supporting point-to-point services with ride sharing capability. 
The system manages a fleet of AVs and accommodates a 
number of transportation requests. We focus on two major 
problems in the system: scheduling and admission control. 
The former is to conhgure the most economical schedules and 
routes for the AVs in order to satisfy the admissible requests. 
The latter is to determine the set of admissible requests 
among all requests so as to produce maximum profit. We 
formulate the scheduling problem as an MILP. The admission 
control problem is cast as a bilevel optimization problem, 
in which the scheduling problem is set as a constraint. We 
propose a GA-based solution method to address admission 
control. We perform a series of simulations with a real taxi 
service dataset recorded in Boston and the simulation results 
show that our solution method is effective in solving the 
problem. By shortening the operating intervals, the system 
can curtail the computation time required to solve the problem 
by limiting the quantity of the submitted requests and it can 
also produce higher profit cumulatively. To summarize, our 
contributions in this paper include: (i) designing the AV public 
transportation system, (ii) formulating the scheduling problem, 
(iii) developing distributed scheduling, (iv) formulating the 
admission control problem, (v) introducing the concept of 
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admissibility and deriving the related analytical results, (vi) 
proposing an effective method to solve the admission control 
problem, and (vii) validating the performance of the solution 
method with real-world transportation service data. 
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